Application of Translation Corresponding Tree (TCT) Annotation Schema for Chinese to Portuguese Machine Translation

نویسندگان

  • Chi-Wai Tang
  • Ming-Chui Dong
  • Yi-Ping Li
چکیده

In Example Based Machine Translation (EBMT) research, there are three main approaches: Surface Based, Pattern Based and Structure Based approach. In Structure Based EBMT system, such as SSTC approach [1], it has a problem that it relies on two syntax parsers to analyze the translation examples, but robust syntax parsers are not always available. On the other hand, Chinese and Portuguese belong to two different language families and there exist grammatical deviation problem between them. In order to resolve the weakness of the Structure Based EBMT system and linguistic problems between Chinese and Portuguese, Tang and Wong [2] propose a new Portuguese to Chinese Machine Translation method and this method is based on a novel technology called Translation Corresponding Tree (TCT) which is an example based knowledge annotation method for Portuguese to Chinese translation. In this paper, it adopts the TCT annotation scheme and introduces the knowledge based construction and translation for Chinese to Portuguese MT. In this parper, it presents the additional problems during Chinese analysis and the corresponding solutions. In this research, it also proposes a conversion algorithm to reuse the existing translation knowledge of Portuguese to Chinese MT system, which represented in terms of TCT trees. Based on the transformation algorithm, the knowledge trees of Portuguese to Chinese translation is converted into that of the translation knowledge which can be used to facilitate the Chinese to Portuguese . By this conversion method, existing knowledge can be easily reused without having to re-construct the knowledge from scratch. Based on the research result of this paper, a Chinese to Portuguese prototyping machine translation system is implemented and the empirical results show that the MT system can achieve the translation accuracy of 85% in the domain of Macau Law statements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Flexible Example Annotation Schema: Translation Corresponding Tree Representation

This paper presents work on the task of constructing an example base from a given bilingual corpus based on the annotation schema of Translation Corresponding Tree (TCT). Each TCT describes a translation example (a pair of bilingual sentences). It represents the syntactic structure of source language sentence, and more importantly is the facility to specify the correspondences between string (b...

متن کامل

The Parsing Algorithm of Translation Corresponding Tree (TCT) Grammar

In machine translation (MT), parsing acts as a kernel step to analyze and acquire the syntactic information of an input sentence for the purpose to reproduce the corresponding translation in target language according to the syntactic relationships between the source and target sentences. The parsing process is guided by a set of language formalism, and the design of such algorithm is highly dep...

متن کامل

Hybrid Machine Aided Translation System based on Constraint Synchronous Grammar and Translation Corresponding Tree

As the demand of translating large volume of material between Portuguese and Chinese is increasing rapidly nowadays, especially in the city of Macau, the translation work becomes impractical without the support of effective tools. In order to fulfill this gap and build up a translation workbench environment for translators, a Machine Aided Translation System between these languages, PCTAssist, ...

متن کامل

Translation Technologies of Portuguese to Chinese Machine Translation System: PCT Assistente

This paper presents the translation technologies of Portuguese to Chinese machine translation system, PCT Assistente a hybrid translation model integrated with example-based and rule-based translation paradigms to counterbalance the intrinsic weakness of these approaches by combining the strong features of another. In the example-based model, translation examples are annotated under the schema ...

متن کامل

Systematic Processing of Long Sentences in Rule Based Portuguese-Chinese Machine Translation

Francisco Oliveira, Fai Wong and Iok-Sai Hong. Systematic Processing of Long Sentences in Rule based Portuguese-Chinese Machine Translation The translation quality and parsing efficiency are often disappointed when Rule based Machine Translation systems deal with long sentences. Due to the complicated syntactic structure of the language, many ambiguous parse trees can be generated during the tr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007